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Abstract 

Given the collapse hypothesis (CH) of quantum measurement, EPR- 
type correlations along with the hypothesis of the impossibility of su- 
perluminal communication (ISC) have the effect of globalizing gross 
features of the quantum formalism making them universally true. In 
particular, these hypotheses imply that state transformations of den- 
sity matrices must be linear and that evolution which preserves purity 
of states must also be linear. A gedanken experiment shows that 
lorentz covariance along with the second law of thermodynamics im- 
ply a non-entropic version of ISC. Partial results using quantum logic 
suggest, given ISC and a version of CH, a connection between lorentz 
covariance and the covering law. These results show that standard 
quantum mechanics is structurally unstable, and suggest that viable 
relativistic alternatives must question CH. One may also speculate 
that some features of the hilbert-space model of quantum mechanics 
have their origin in space-time structure. 



1 Introduction 

Inspired by the paper of Einstein, Podolsky, and Rosen and more re- 
cently by the controversy surrounding the Bell inequalities P, |^, many re- 
searchers have tried to interpret long-range quantum correlations as resulting 
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from some sort of action-at-a-distance. A natural question then arises as to 
whether such correlations can be used for superluminal communication. Sev- 
eral authors 0, ^, ^, |^ argued that no such communication is possible since 
the statistical behavior of any detector placed on one arm of an EPR appara- 
tus is completely independent of what is done on the other arm. Nick Herbert 
argued that if one can construct a "photon duplicator" which reproduces 
exactly an incoming photon state, then superluminal communication is pos- 
sible. This argument quickly provoked several rebuttals ||^, |1^, |ll[] to the 
effect that no linear state transformer can clone an arbitrary photon state. 
It seems therefore that ordinary hilbert-space quantum mechanics precludes 
the use of EPR-type correlations for superluminal communication. We ad- 
dress in this section the reciprocal question: under the hypothesis of the 
impossibility of superluminal communication, what can one deduce about 
the behavior of detectors and state transformers? It is a common view that 
superluminal communication should not be possible as it raises serious ques- 
tions of relativistic covariance and causality. These will be further considered 
in Section ^. 

To be able to attack the question at hand we shall initially work with the 
hypothesis that there are possibly some physical processes that may not con- 
form to the usual quantum mechanical description but that these are specific 
to very particular situations, whereas for the vast majority of other processes 
(including all those experimentally studied up to now), any deviation from 
normal quantum mechanical predictions is below present experimental pre- 
cision. Thus one may posit that there may be a state transformer, such as 
the hypothetical photon doner mentioned above, that acts in a non-linear 
fashion, and that such a transformer may take part in an experimental ar- 
rangement in which normal quantum mechanical description is adequate for 
processes not involving it. Explicitly the hypothesis is then that, in any given 
inertial frame, up to the use of an unconventional device, the usual quantum 
mechanical reasoning can be used, including the projection rule. Up to such a 
moment, ordinary quantum mechanics determines what the physical state is. 
At the point of using the unconventional device we of course must posit what 
would happen (a photon would be cloned in the above cited example). We 
shall call allowing the situations described above the neighborhood hypoth- 
esis (NH) since what we are describing, informally speaking, is a situation 
that would neighbor standard quantum mechanics in the set of all possible 
physical theories. 

We shall show in sections § and |^ that under NH, certain types of devi- 
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ation, specifically non-linearities and lack of true randomness of outcomes, 
allow for superluminal signals. These theories are thus to be ruled out if we 
assume the impossibility of superluminal communication (ISC). This makes 
ordinary quantum mechanics a structurally unstable theory. This is impor- 
tant as many proponents of modifications to ordinary quantum mechanics 
are in fact implicitly assuming NH and so face a real risk of coming into 
conflict with special relativity. 

It must be emphasized that NH is essentially an assumption about for- 
malism and not about interpretation. It is an alteration of the formalism 
that is generally part of what is known as the Copenhagen interpretation, 
but we make no interpretational hypotheses. Though state collapse is used, 
we make no assumption as to its ontological nature, only that it is a legit- 
imate calculating device for joint probabilities of events. In the end what 
we are saying is that joint probabilities cannot be calculated by certain rules 
if ISC is to be maintained. This makes our results basically interpretation 
independent. 

It should also be noted that part of our understanding about the standard 
formalism is that it is capable of giving account of a relativistically covariant 
theory. This is not straightforwardly obvious given the instantaneous nature 
of wave function collapse |jl2|, [13|, but this does not preclude lorentz covari- 
ance of observable quantities. What the standard formalism lacks is thus 
manifest covariance while being able to provide for covariance of measur- 
able magnitudes. It is precisely this fact that makes the theory structurally 
unstable, for a perturbation in the formalism is likely to make the mani- 
fest non-covariance capable of producing real effects, such as superluminal 
communication. 

In section ^ we relate ISC and the second law of thermodynamics showing 
that under certain hypotheses superluminal communication can be used to 
foil the second law. In section || we argue that ISC, along with the projection 
postulate, can be used as supporting evidence for assuming certain axioms 
in the foundations of quantum mechanics thus suggesting that quantum me- 
chanics owes some of its aspects to space-time structure. In the last section 
we draw some brief conclusions from these considerations. 
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2 The projection postulate and superluminal 
communication. 



We shall assume the existence of an EPR-type apparatus by which a physical 
system decomposes into two parts which then separate in such a way that 
future measurements on each part separately can be performed at space-like 
separation. To avoid the complications of Bose or Fermi symmetrization, we 
shall assume the two parts are not identical, nevertheless we posit that the 
internal degrees of freedom of each part are described by finite-dimensional 
hilbert spaces of the same dimension which is at least three. In what follows 
we assume that the wave function factors into a product of the spatial part 
and the internal part. We focus only on the internal factor. Let ei, . . . , e^v 
and /i, . . . , /iv be orthonormal bases for the hilbert spaces of the internal 
degrees of freedom. We shall then assume that one can prepare the composite 

system in the state = {l/y/N){fi ® ei + f2 <S) 62 + h /at (8) cn)- We 

also assume that for one arm of the apparatus (hereafter referred to as arm 
A), given any basis hi, . . . ,hN for the corresponding hilbert space, we can 
measure a non-degenerate observable with this eigenbasis. Swift and Wright 
TM have argued that for certain spin systems one can physically prepare 



a state corresponding to any ray in the corresponding hilbert space, and 
construct an actual apparatus corresponding to any self-adjoint operator. 
We shall thus generally assume that there are apparatus corresponding to 
any self-adjoint operator that we consider. We chose a reference frame in 
which the observation on arm A is temporally prior to that on arm B and, 
under the Neighborhood Hypothesis, and the assumption that superluminal 
communication is impossible, determine what must hold at arm B. 

We first note that = ^Uijhj for some unitary matrix u. This means 
that \I' = (l/-\/iV) Hdj ® hj where Qj = J2uijfi defines another orthonormal 
basis, and any basis can be so constructed. Under the usual projection 
postulate, if we now perform observations corresponding to a non-degenerate 
self-adjoint operator with eigenbasis hi, . . . ,h]\f on arm A, then subsequently 
the state collapses to a mixture in equal proportions of product states gj^hj, 
and consequently the part of the system that is in arm B can be construed 
as being, in equal proportions, in the well defined quantum states gj. Let 
us now locate a detector on arm B of the apparatus and suppose that the 
detection rate for a pure state is -D(0). The detection rate on arm B 
is then {1/N)J2 D{gj). For superluminal communication to be impossible 
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this number must be independent of the orthonormal basis gi, . . . ^g^ since 
otherwise the person stationed on arm A could, by changing his observable, 
induce a change in the detection rate on arm B at a space-like separation. 



This means by Gleason's theorem [15|, that there is a positive operator R 



such that D{(j)) = {(f), R(f)). Since an observable can be construed as the 
simultaneous action of a number of mutually exclusive compatible detectors 
(one for each "pointer position"), we also conclude that on arm B, the mean 
value of any observable in state must be given by the usual quantum 
mechanical formula (0, Acp) for some self-adjoint operator A. As far as the 
expectation values of observables are concerned, arm B must then also follow 
the usual rules. Let us now consider on arm B a state transformer that 
transforms a state into another, possibly mixed state. Let us indicate 
this action on the corresponding density matrices as i— Tp^, where = 
(0, ■)(/) and T is a-priori an arbitrary map. If, after we subject the incoming 
state to transformation T, we then perform an observation corresponding 
to a conventional quantum observable represented by the operator A, the 
resultant expected value is Tr{ATp^) and by our previous result this must 
be of the form (0, r(74)0) for some self-adjoint operator t{A). Choose now 
an orthonormal basis ki, . . . ,kp for the space in which the transformed states 
lie (which may be different from the space in which lies, as would be the 
case for the putative "photon doner"). Setting A = (l/2)((A;p, ■)kq+{kq, ■)kp) 
and then A = (l/2z)((/cp, ■)kg — {kg, ■)kp) one deduces that Re {kp,Tp^kq) = 
(0, Mpgcj)) and Im {kp, Tp^kq) = (0, NpqCp) for some self-adjoint operators Mpq 
and Npq with Mpq = Mqp and Npq = —Ngp. If we now set Lpq = Mpq + iNpq 
we have (/cp, Tp^kq) = (0, LpqCp) and finally 

^P'P = J2(^^^pq^)(Kr)K = J2Tr{Lpqp^){kq,-)kp 

but this is a linear function of p^. We thus conclude that at arm B any 
density matrix state transformer must be linear. Finally, suppose the state 
transformer does not turn pure states into mixed states, then we can write the 
transformation as i-^ 5*0. Now by the previous result one has that (0, ■)0 ^— >■ 
(5*0, ■)S'0 must be a restriction of a linear map and given by the formula 
above. We have been able to prove a theorem (see the appendix) which states 
that in this case, if the range of S is not confined to one ray, (5*0, ■)5'0 = 
(C0, ■)C0 where C is either a linear or an anti-linear t ransformation. Non- 
degenerate state transformers on arm B that do not create mixed states 
from pure ones must therefore be representable by either linear or anti-linear 
mappings. Summing up: 
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Under the Neighborhood Hypothesis and normal quantum me- 
chanical processes in relation to one arm of an EPR-type appara- 
tus, and assuming the impossibility of superluminal communica- 
tion, then at the distant other arm 1 ) the mean value of observ- 
ables must be given by the usual quantum mechanical expectation 
value formula 2) density-matrix state transformers must be linear 
and 3) vector state transformers whose range contains more than 
one ray must be either linear or anti-linear. 

The wayward transformers are those of the form Scj) = 9{(l)){(l), D(f)y^'^ip 
for some positive operator D, vector ip, and unimodular function 6. Note that 
{S(j), AScp) is just (0, D(l)){ip, Alp) and so no measurement on Scj) provides any 
more information about than that aheady given by the matrix elements 
(0, D(f)). Hence the transformer is no more than the observable D in disguise, 
no real information about being passed on to the transformed state ip. This 
degenerate case is thus of little further interest. 

The analysis of this section doesn't go through for quantum systems 
described by a two-dimensional Hilbert space as Gleason's theorem does 
not then hold. Nevertheless, as Nick Herbert's example shows, certain 
non-linearities can still lead to superluminal communication. We have not 
determined what the exact restrictions on non-linearities are in the two- 
dimensional case as these are specialized physical situations. Although logi- 
cally tenable, the idea that physics changes radically once a physical system 
described by a two dimensional Hilbert space gets isolated from the environ- 
ment, is to allow what must be deemed a rather bizarre situation. From the 
still informal viewpoint of a neighborhood of standard quantum theory, such 
theories would not constitute a generic situation and are thus left out of this 
first analysis. 

What this all suggests is that some of the gross features of the quantum 
mechanical formalism are intimately connected to spatio-temporal relations. 
In particular, if part of the world follows standard rules, then the rest must 
follow suit if superluminal communication is to be ruled out. There cannot 
be any "small deviations" . To what extent the impossibility of superluminal 
communication, per se, limits any possible physical theory is not clear. One 
is aware of the great difficulties in trying to justify the quantum formalism 
on clear a-priori physical grounds. The existing axiomatic schemes, of which 
the two most developed are due to Ludwig and Piron |T^, all suffer from 



the defect that the crucial axioms are by no means compelling nor even clear 
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as to their true physical content. There is now the intriguing possibihty that 
some spatio-temporal hypothesis such as the impossibility of superluminal 
communication could result in hilbert-space quantum mechanics in a more 
natural way. We begin to address this question in Section || of this paper. 

Various researchers have expressed the hope that some paradoxes of quan- 
tum mechanics (especially those connected with "Schrodinger's Cat" , that is, 
measurement theory) would eventually be resolved by non-linearities (which 
presumably become more important as the number of particles increases) 
in the time evolution [1^. We see now that any such non-linearity carries 
with it the real possibility of superluminal communication and its concomi- 



tant problems. Steven Weinberg M, 20 1 has proposed a non-linear quantum 



theory which in principle could be tested experimentally by effects such as 
spectral line broadening. Any such effect could be immediately used to build 
a superluminal communication device. That this is possible for the Weinberg 
theory has already been shown by Gisin who has independently come 



to conclusions similar to the ones presented in this paper. That there is a 
connection between quantum evolution and superluminal communication has 
been pointed out in various articles by Gisin p2|, |2^, |4|, and by Pearle 
p5| , |26[] . Our result shows the generality of such a connection, it essentially is 
a consequence of the projection postulate and the structure of hilbert space, 
concretely Gleason's theorem. 

Deeming superluminal communication undesirable, one can try to avoid 
it, while maintaining non-linearity, by modifying the projection postulate. 
While logically plausible, there are at present no known examples of a rel- 
ativistic non-linear quantum mechanics. We discuss elsewhere the pos- 
sibility of such a theory. In recent years there has been a growing interest 
in investigating non-linear evolution in spite of the reasons, such as those 
discussed in this paper and elsewhere that have been brought forth 
against it. The feeling seems to be that an appropriate reformulation of 
the measurement process would eliminate the difficulties. Investigations by 
G. A. Goldin, H.-D. Doebner, and P. Nattermann p9| , |3T| show that by a 
reasonable restriction on the set of allowed measurements, certain non-linear 
Schrodinger equations are then observationally equivalent, via a non-linear 
"gauge transformation" to the free linear equation. This shows that non- 
linearity per se may not lead to superluminal signals under a reasonably 
modified measurement postulate. As the equation studied by these authors 
are non-relativistic, we are still far from understanding the true relation of 
linearity to relativity. 
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3 Quantum indeterminism and superluminal 
communication. 



The so called "quantum indeterminism" also seems to be connected with 
spatio-temporal relations. To see this, still under the Neighborhood Hypoth- 
esis, consider the already familiar EPR situation for the singlet two-photon 
states: there is a photon in each arm of the apparatus, and the combined 
state is the singlet: (1 / -\/2) (H 0V + V H) where V indicates vertical linear 
polarization and H the orthogonal horizontal one. Consider a linear po- 
larizer. The usual assumption is that the passage of a non-polarized photon 
(such as one of the photons of the singlet state) through this polarizer is abso- 
lutely random. If we write down a sequence of O's and I's where means the 
photos is absorbed, and 1 passed through, then this sequence is thought to 
be at least a von Mises random sequence [32| in that no computable function 
that tries to predict the next outcome on the basis of outcomes already seen 
does better than chance. Suppose now that besides such a "randomizing" 
polarizer one also has a non-randomizing one for which some computable 
predictor does better than chance. Let one person then stay on arm A of the 
EPR apparatus armed with the two kinds of polarizers, which he may place 
in a horizontal orientation. Let his colleague stay at arm B with a polarizer 
placed in the vertical position, and let him use the computable predictor to 
guess whether the photon passes or not. Due to the strict mirror correlations 
in the singlet state, the sequence seen at arm B is the same one that is seen 
at arm A. Suppose that at arm A the randomizing polarizer is placed, then 
at arm B the predictions are seen to be no better than chance. Now one 
places the non-randomizing polarizer at arm A and shortly thereafter one 
sees at arm B better than chance predictions. If the arms are sufficiently 
distant this constitutes superluminal communication. Thus if quantum in- 
determinism exists in part of the world it must be universal if superluminal 
communication is to be ruled out. Once again, under the assumption of the 
impossibility of superluminal communication, distant quantum correlations 
have the effect of universalizing gross features of quantum mechanics. As 
before, it is not at all clear at this stage if the impossibility of superluminal 
communication, per se, imposes von Mises randomness on quantum events. 
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4 The second law of thermodynamics and su- 
perluminal communication 

In Sections |^ and ^ we showed that assuming the impossibihty of superlumi- 
nal communication, and given the projection postulate, one can deduce that 
certain gross features of the quantum formahsm must be universal and not 
admit any deviation. In this section we explore the relation of the hypothesis 
to thermodynamics by showing that given lorentz covariance, the second law 
of thermodynamics precludes non-entropic superluminal communication. 

The second law and the hypothesis of the impossibility of superluminal 
communication have many superficial similarities. The most important of 
these is that both are statements about general physical objects of arbitrary 
size and complexity, stating that no such object can perform a certain task. 
In the case of the second law the task would be (among many equivalent 
formulations) to extract useful energy from heat, in the second case that of 
transmitting a message across a space-like interval. Superluminal communi- 
cation per se is innocuous and only becomes problematic if its underlying 
physics is required to be lorentz covariant. In this case, it is well known that 
one can then send a message from a point on a time-like world-line to another 
point on the same world-line in the causal past of the first one (communi- 
cation to one's past). It would then be possible for me to receive a message 
signed by me and dated tomorrow that reads: "Under no circumstances send 
this message!" Believing in free will, I take a firm resolve to obey the request. 
Am I then coerced in spite of my resolve to send it? If so, what happened 
to free will? If not, how did I happen to receive it? Superluminal communi- 
cation in a lorentz covariant world is fraught with such causality paradoxes. 
In what follows we shall hypothetically assume such retrograde messaging 
in contexts where the paradoxical nature is either minimized or absent, tac- 
itly supposing that its more bizarre manifestations get somehow resolved or 
avoided through mechanisms or circumstance yet unspecified. Presumably 
a consistent lorentz-covariant mathematical theory involving superluminal 
communication might resolve the paradoxes by imposing adequate boundary 
conditions on the solutions of the dynamic equations. 

The second law is also about entropy, saying that it does not decrease in 
closed systems. Entropy is intimately tied to information. Communication 
is the transfer of information. It is then not surprising that if information 
can flow from the future to the past, reversing its normal direction, then 
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entropy in a closed system where such messaging takes place can be made to 
decrease, also reversing its normal direction. 

In contemplating the thermodynamics of a system involving information 
transfer, it is necessary to take into account the entropy created in the pro- 
cess. Now a functioning communication channel should not in principle create 
entropy. To be clear, we are here considering the channel mechanism itself 
and not the processes involved in getting the message into and out of the 
channel. All such known channels involve the transfer of energy or matter 
(think of television or the postal service) and actually create entropy, but this 
entropy is incidental to the actual information transferred and comes about 
through inefficiencies that in principle can be eliminated or minimized below 
any preassigned level. One can communicate by a conservative and purely 
mechanical device. All that is needed is that the observed motion, invoked 
by the sender, have a meaning to the receiver. There are no known chan- 
nels of superluminal communication. The hypothesized schemes or involve 
tachyons whose existence has not been established, or EPR-type correlations 
which cannot be used for communication unless, as was shown in Section |^, 
the hilbert-space quantum formalism breaks down, for which likewise there 
is no evidence. But even in such hypothesized channels there is no evident 
reason for energy degradation to be necessary. We shall therefore assume ini- 
tially that our putative superluminal communication process does not create 
entropy, or at least that such entropy can, for a given message, be reduced 
as much as necessary. There is though some unavoidable entropy creation 
involved in observing a system (to get the necessary information that will 
become the content of the message) as has been pointed out by Brillouin 



33|, |3^. As the neglect of this type of entropy would lead to second-law 



violation, via a Maxwell's demon type argument, even without positing su- 
perluminal communication, these entropy sources must be considered. 

We shall begin our discussion then with the assumption of a non-entropic 
superluminal communication process whose underlying physics is lorentz co- 
variant. We then construct a communicator that sends messages to the past. 
This generally is achieved by signaling between different inertial frames, but 
this can be done by conventional means and so we can continue to posit that 
no entropy is created. 

There are many (unworkable) proposals to foil the second law using ther- 
modynamic fluctuations and one-way mechanical devices (ratchets, valves, 
etc.). The idea is that the device would be activated by a fortuitous but in- 
evitable fluctuation, and the one-way nature of the device would prevent a re- 
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verse motion thereby storing energy (in a spring say) which is then extracted 
and used. One need only then wait for the next fluctuation to continue the 
process of extracting useful energy from heat. All such schemes fail. They 
seem plausible since one generally overlooks details of exactly what happens 
when the fluctuation activates the device. Surprisingly enough many, if not 
all, such proposals can be made to work if one can anticipate events, and 
retrograde communication makes this possible. Let us contemplate then one 
such proposal. Consider an enclosed rectangular volume with a rigid parti- 
tion in the middle dividing the volume into two chambers (denoted by A and 
B) filled with a gas at the same pressure and temperature. Let us put a small 
one-way valve on the partition, the mobile element of which is held in place 
by a spring. If the local pressure on side A of the valve exceeds that on side 
B by some threshold difference then the force of the spring is overcome and 
the valve opens. Pressure excess on the other side has no effect due to the 
one-way nature of the mechanical construction. The (fallacious) argument is 
that due to pressure fluctuations the valve would open from time to time and 
allow gas molecules to pass from the supposedly (local) higher pressure side 
to the lower pressure side transferring molecules from chamber A to B and 
building up a true pressure (and temperature) difference violating the sec- 
ond law. The argument falls down because the molecules that push the valve 
open have already recoiled and are not the ones that actually pass through 
it. These, passing from both sides, have such a distribution of velocities that 
no pressure difference is built up. If however it were known beforehand when 
the valve would be pushed open, then one can simply temporarily remove 
the valve, exposing thus an orifice on the partition, and then the exchange 
of molecules across the orifice would be such that those passing from A to 
B would have a larger mean velocity than those passing in the other direc- 
tion. Such information can be had if retrograde communication is possible 
and the second law can then be violated. This is essentially a variant on 
the Maxwell's demon scheme. The usual Maxwell's demon cannot violate 
the second law, since the process of observing the velocity of molecules, if 
done by actual photons, creates more entropy than the decrease achieved in 
operating the orifice closure to selectively let molecules pass from one side 
to the other |53|]. The observing photon is necessarily of higher energy than 



the mean kinetic energy of the molecules and this thwarts the attempt. Our 
demon (observer) does not observe the molecules, all he need become aware 
of is a normal consequence of a fluctuation, such as a valve opening, and this 
can be done with low energy photons since the dimensions of the valve are 
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much larger than those of the molecules and it's motion is much slower. 

This scenario does have its paradoxical aspects (inevitable consequence 
of the conjunction of lorentz covariance and superluminal communication). 
If the observer has free will and receives the correct information that at some 
instant the valve will be pushed open, then by removing the valve during a 
small time interval he destroys the chance of observing the valve being acti- 
vated. He can still send the message to his past to maintain consistency with 
the fact of having received it (not doing so creates an even bigger paradox) but 
then the message has no real basis in observed events. This is self-consistent 
but hardly satisfying. To maintain some semblance of the ordinary order of 
things one can assume that the valve itself has an aperture that can be opened 
and closed and that it is small relative to the size of the valve so that even 
with the aperture open, pressure fluctuations still activate the valve. The ob- 
server now only send a message if he sees the valve activated (having already 
also opened the aperture by request of the message received) . This still works 
as fluctuations have a certain size which gives a greater probability to trans- 
ferring, via the aperture, higher velocity molecules from A to B and lower 
velocity ones in the opposite direction than to doing the opposite. Having 
reached this stage we can now actually eliminate the Brillouin "observer." 
One can automate the whole communication and valve operating process. 
The mechanism that receives the retrograde message can be coupled directly 
to the mechanism that opens the orifice. The message content itself can be 
reduced to a single bit, whose meaning would be "exactly T seconds from 
now open the orifice" . One would need to keep an accurate clock and operate 
a device based on it, but this again in principle need not create entropy. The 
whole gadgetry surrounding the retrograde communicator can be taken to 
be a conservative system. The act of observing the valve can be eliminated 
by mechanically coupling the valve mechanism itself to the communication 
system. At this point the paradoxical nature of retrograde communication is 
practically absent. Our Rube-Goldberg-like gedanken experiment should of 
course be taken as a sort of visualizable enactment of a physical process that 
would be governed by a precise mathematical theory (using both advanced 
and retarded potentials, say), if indeed it were desirable to construct such a 
theory. 

The assumption that the superluminal communication channel does not 
create entropy is crucial to the above argument. One may think that this 
hypothesis is not essential, arguing that what one need do is simply operate 
the scheme in those circumstances where the entropy decrease achieved (by 
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anticipating an adequate fluctuation, say) is greater than the entropy gain 
in using the channel. Although with specific assumptions about entropy 
created by the channel one can carry this through, there doesn't seem to be 
any general argument, at least we haven't been able to find one. We therefore 
tentatively conclude that one must necessarily impose restrictive assumptions 
on entropy creation in the channel in order to violate the second law. 

Let us now summarize the hypotheses that we are relating: (LC), lorentz 
covariance; (SL), the second law of thermodynamics; (ISC), the impossibility 
of superluminal communication; (INESC), the impossibility of non-entropic 
superluminal communication, and (UQFCH), the universality of the quantum 
formalism with the collapse hypothesis. 

We have finished showing: {LC and ~ INESC =^ ~ SL) which 
we prefer to state as: {LC and SL ^ INESC). In Section ^ we have 
proved: {ISC =^ UQFCH). In contrapositive the last two implications read: 
(~ UQFCH ISC) and (~ INESC ^ LC or ~ SL). What prevents 
linking these into a single implication is of course the non-entropic hypothesis, 
but in any case the two results clearly show what quantum breakdown entails. 

Suppose that one makes an experimental test of quantum mechanics and 
finds a discrepancy between the experiment and the quantum prediction. 
This would typically be of the form < R >txp^ Ar4^) where the left-hand 
side is the experimental mean value of some observable R in the state 0, 
and Ajj is the self-adjoint operator that has been associated to this observ- 
able. Now one may be able to find a self-adjoint operator Bji such that 
for all the states tested one has < R >fxp= {(p^BRcf)). In this case the dis- 
crepancy would most probably be explained by saying that, for some reason 
or other, the relation previously established between the observable R and 
was in fact wrong and the correct relation should be to the operator 
Br. Quantum mechanics would be preserved, "discrepancy" would then be- 
come "effect" and receive some usual explanation within the formalism as 
due to some new interaction energy, particle, etc. As such effects accumu- 
late one can even foresee the possibility of a "keplerian" revision in which 
an equivalent formalism gives an account of the observed physics in a more 
elegant simplified form, but this would not be a true subversion of quantum 
mechanics as we know it. If however the observed phenomena cannot be 
explained by using a different self-adjoint operator, then, under the assump- 
tions of Section |^, superluminal communication becomes reality. Once this 
happens, causal paradoxes arise if one is to maintain lorentz covariance. If in 
addition the superluminal channel is non-entropic, then one must definitely 
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abandon either lorentz covariance or the second law. In any case lorentz co- 
variance is seriously threatened. Barring this, the projection postulate must 
be modified. In short, quantum mechanics will either persist essentially in 
its present form with absolutely no discernible deviation, or there will be a 
radical revision of physics. There cannot be any small revision. Quantum 
theory, thermodynamics, and lorentzian space-time are so interrelated that 
what affects one affects all. 

And independent argument relating the second law of thermodynamics 
to locality (absence of superluminal infiuences) has been brought forward by 
Elitzur without however expliciting the non-entropic assumption con- 
cerning the mechanisms of the hypothetical infiuence. 



5 The covering law and superluminal com- 
munication. 

The results of Section ^ are "local" in nature in that they shows that no 
deviations of certain types from ordinary hilbert space quantum mechanics, 
no matter how small, can be allowed if the impossibility of superluminal 
communication is to be maintained. But one is then immediately led to ask 
the interesting "global" question as to whether the impossibility of superlu- 
minal communication can be used as an axiom leading to hilbert space itself. 
This would provide a truly physical basis for hilbert space quantum theory 
motivated by the space-time condition of lorentz covariance. In this section 
we present some preliminary results in quantum logics that show that such 
a thesis is plausible. In a subsequent publication we shall present further 
evidence, which does not appeal to signals, connecting lorentz covariance 



and the hilbert space structure. We know from Piron that a general- 
ized hilbert space model can be constructed if the quantum logic obeys the 
so-called covering law. It is this law that we shall try to substantiate by 
space-time considerations. A possible relation between the covering law and 
the non existence of superluminal signals was also independently postulated 
by Nicolas Gisin (private correspondence) who derived the covering law from 
a condition similar to our CNS below. 

Now whereas in Section |] given the collapse hypothesis, we derive facts 
about detectors from the structure of hilbert space, and the hypothesis of 
the impossibility of superluminal communication, to prove anything in the 
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reverse direction seem to be impossible without positing something about 
the detection process, for otherwise we can simply and circularly define de- 
tections as those processes that do not allow long-range correlations to be 
used to send superluminal signals. This means that one must append to 
the axioms of quantum logic some postulates about the detection process. 
Fortunately, enough such expositions already exist in the literature and we 



shall simply appeal to some of these introduced by Guz ^ ^] to make 
our point. In this we adopt Guz's notation to facilitate comparison with the 
cited works. 

We start off conventionally with an orthomodular poset L of physical 
"propositions", and a convex set S of probability measures on L which are 
to represent "physical states" . The set P of extreme points of S correspond to 
pure states. We assume that to each pure state p there is an atomic "indicator 
proposition" s{p) that singles it out in that p{s{p)) = 1 and q{s{p)) < 1 for 
q ^ p. The map p ^ s{p) is assumed to provides a bijection between the 
pure states and the atoms of L. 

Measurements are to be described by a "collapse" scheme pretty much 
modeled on the notion of an ideal measurement in the conventional quantum 
formalism. In this view, if a proposition a is tested in a pure state p and 
found to be true, then the state is transformed ("collapses") into a new pure 
state Pa- This collapse occurs with frequency p{a). This scheme is subject 
to the following conditions: 

1. Propositions that test with certainty do not collapse the state. That 
is. Pa = P whenever p{a) = 1. In particular, pi = p and Ps(p) = P- 

2. A repeated measurement yields the same results. That is, s{pa) < a. 

For two pure states p, q the number {p : q) = p{s{q)) is called the transition 
probability of p to g and corresponds to the fraction of times p collapses to 
q when tested by the indicator proposition of this last state. One sees that 
{p : q) = 1 implies that p = q. 

We shall assume that given a set of pair-wise orthogonal propositions 
{bi, . . . ,bn} there is a physical apparatus that tests them simultaneously 
with mutually exclusive outcomes. If the state is p E P then the outcome 
that renders proposition bj true and all the others false occurs with frequency 
p{bj) = {p : ph ). Note that in this case the "detection rate" of the whole 
apparatus is Y.]=i{p ■ Pb,) = E"=iP(&i) = p{bi V ■ ■ ■ V If V"=i &i = 1, 
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then after the state passes through the apparatus it becomes the mixed state 

n 

Let us now contemplate again an EPR-type space-time situation in which 
one has a state p, at one site A an apparatus corresponding to a proposition 
a & L, and at a distant site B an apparatus corresponding to the pair-wise 
orthogonal set of propositions {61, ... , 6„} with Vj=i bj — 1. The arrange- 
ment is to operate in such a way that the events corresponding to registries 
in the apparatuses are space-like separated. We thus assume that the propo- 
sitions a along with the hj form a commuting set. For there to be no signals 
from site B to site A due to correlations present in state p (we will call this 
the "no-signal hypothesis" ) , the detection rate at A must be independent of 
the apparatus used at site B. 

Now it doesn't seem possible to deduce the covering law just from the 
no-signal hypothesis. An extension of this hypothesis however to any situ- 
ation formally as above where only commutativity (and not just space-like 
separation) is assumed does lead to the covering law. 

Commutative no-signal hypothesis (CNS): Let a,bi, . . . ,bn ^ L be 
a commutative set and suppose the bj pair-wise orthogonal with 
Vj=i &j = 1; then J2]=i{p '■ Pbj)Pbj{o,) is independent of the set 

{61,..., 

This can be viewed as saying there is no "statistical quantum contextualism" , 
that is, the detection rate of a proposition is independent of what compatible 
mutually exclusive and exhaustive set of propositions one has measured just 
prior to it. If this set of propositions is measured at a space- like separated 
site, this follow from the no signal hypothesis. 

Now one particular set of bj that one can pick is the singleton {1} for 
which the above sum is (p : pi)pi(a) = p{a) and so the number whose 
independence is posited has to be p{a). Let now a and h be two commuting 
propositions, then from the CNS we deduce: 

{p ■ Pb)Pb{a) + {p : Pb')Pb'{a) = p{a) 

{P ■ PbAa')PbAa'{a) + {p ■ PbAa)PbAa{a) + {p ■ Pb')Pb'{.o) ^ p{a) 
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from which using the fact that PbAa'io,) = and PbAa{ci) = 1 we deduce: 
{p '■ Pb)Pb{ci) = {p : PbAa) which is the same as 

{P ■■ Pb){Pb ■■ {Pb)a) = {P ■■ PaAb) 

which says that the detection rate with the successive observations of h fol- 
lowed by a is equal to the detection rate of the observation of a A 6. By 
complete symmetry we also have {p : Pa){Pa ■ {Pa)b) = {p ■ Pahb)- 

Consider now h = s{qa) where q is any other state. As 6 < a, the two 
commute and we can apply the above results. Now, since b is an atom, for 
any state r one has = s~^{b) = qa provided r(6) ^ 0. Assume p{b) ^ 0, 
then by the last formula of the previous paragraph one deduces {p : Pa){pa '■ 
Qa) = {p ■ Qa)- If now {p : Pa) = {p : Qa) one concludes that {pa : qa) = I 
which implies that Pa = qa thus one has shown 

{P '■ Pa) = {P ■ qa) ^ ^ Pa = qa- 

What is interesting about this result is that this is precisely the property 
one needs in the axiomatic scheme presented by Guz |^ to deduce the cov- 



ering law for the poset L. Thus if one can somehow justify the extension 
of necessary conditions on space-like separated measurement to generally 
commuting measurements, the covering law would follow from the no-signal 
hypothesis. Lacking this, we have a weaker result in that a stronger hy- 
pothesis (CNS), leads to the covering law in at least one existing axiomatic 
framework. This axiomatics was not, to our awareness, conceived to estab- 
lish a connection between space-time structure and the covering law and as 
such may not be the best to make such a connection plausible. We shall 
address this problem in a future publication. There is a general weakness 
in all present axiomatic schemes for quantum mechanics, they do not dis- 
tinguish space-like separated propositions from other commutative pairs, for 
the only relation that one normally posits between space-time and quan- 
tum logic is that space-like separation leads to commutativity. Only by a 
joint axiomatization of both lorentzian space-time and hilbert space quan- 
tum mechanics can one hope for anything better. Now commutativity is 
generally interpreted as "commensurability" and this means the ability to 
make simultaneous measurements which in turn suggests that space-like sep- 
arations always play some role in commutativity, yet this has never been 
thoroughly examined. Furthermore if one believes in general unitary sym- 
metry in hilbert-space, it's not hard to find examples in which a commutative 
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pair of observables pertaining to a single localized system (a bound pair of 
particles say) is unitarily equivalent to a pair of observables at the opposite 
arms of an EPR apparatus. Such a symmetry would thus extend the re- 
quirements on space-like commutativity to commutativity in general. That 
arguments pertaining to space-like commutativity can be carried over to gen- 



eral commutativity has already been pointed out by Home and Sengupta |39 



in relation to Bell's inequalities. It may thus well be that the weaker result 
with the CNS assumption is not too distant from the desired strong result 
especially if one imposes strong symmetry requirements. In any case, it is 
clear that one cannot hope to reach a full understanding of hilbert-space 
quantum mechanics without linking it to space-time structure. 



6 Conclusions 

In previous sections we gave some arguments in support of the thesis that, 
given certain hypotheses, standard quantum mechanics is structurally unsta- 
ble, and neighboring theories generically run into difficulties with relativity. 
This spells trouble for those that feel that standard quantum mechanics is 
only an approximation and propose alternatives. Great care must be taken 
in formulating these alternatives. Some have even come to feel that all al- 



ternatives are ruled out. Stephen Weinberg in his book ||4^ has tentatively 
reached this conclusion: 

... I could not find any way to extend the nonlinear version of 
quantum mechanics to theories based on Einstein's special theory 
of relativity . . . both N. Gisin in Geneva and my colleague Joseph 
Polchinski at the University of Texas independently pointed out 
that . . . the nonlinearities of the generalized theory could be used 
to send signals instantaneously over large distances. . . At least for 
the present I have given up on the problem; I simply do not know 
how to change quantum mechanics by a small amount without 
wrecking it altogether. 

This theoretical failure to find a plausible alternative to quantum 
mechanics, . . . suggest to me that quantum mechanics is the way 
it is because any small change in quantum mechanics would lead 
to logical absurdities. If this is true, quantum mechanics may be 
a permanent part of physics. Indeed, quantum mechanics may 



18 



survive not merely as an approximation to a deeper truth, . . . but 
precisely valid feature of the final theory. 



A careful analysis of what really goes into the perception of this rigidity 
reveals the striking role played by the projection hypothesis or some modifica- 
tion thereof. This hypothesis in turn is based on the notion of instantaneous 
physical state which obviously is a notion tied to a given inertial frame. It is 
this frame-dependence that cannot be reconciled with relativity in the alter- 
native theories. Abandoning such a frame-dependent notion would mitigate 
arguments against alternatives and opens up a true possibility for changing 
quantum mechanics "by a small amount" (as far as numerical predictions are 
concerned) without "wrecking it altogether" (maintaining relativity). Wein- 
berg's reasons for his final speculation on the survival of quantum mechanics 
may in the end not be all that compelling. 

In recent years there has emerged a new quantum mechanical formalism 
(along with several interpretations of it) based on consistent histories and 
decoherence |]4ll, |4^, that essentially does away with the projection postulate 
and to a large extent with the notion of instantaneous state. As such, it 
escapes the analysis of this paper and shows a very promising possibility 
for those that wish to modify quantum mechanics and still maintain special 
relativity. We shall investigate these questions in a future publication. 
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A Appendix: The theorem 

We start with some notation and preliminaries. Our hilbert spaces are com- 
plex and finite dimensional. For a hilbert space H we denote by L{H) the 
space of operators in if, and by I the identity map. If e e if we denote 
by |e| the equivalence class of e under the relation by which two elements 
e and / are equivalent if and only ii e = Of for some unimodular complex 
number 6. We denote by Sh the resulting quotient space. A positive rank- 
one operator is necessarily of the form (e, ■)€ with e determined only up to a 
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unimodular multiple. Thus the space of positive rank-one operators is in a 
bijective correspondence with Sh minus the class of the zero vector. We shall 
reserve the Greek letter p to represent a positive rank-one operator. A map 
G : Sh — »■ Sk can be lifted in infinitely many ways to a map G : H ^ K 
by associating to e G if an arbitrarily chosen element of the equivalence 
class G\e\. We say that G represents G. Two such representations Gi and 
G2 are related by Gif = 6{f)G2f for some unimodular function 6. If G 
can be chosen to be linear or antilinear, we say that G is linearly or anti- 
linearly representable. We reserve the Greek letter 6 to indicate unimodular 
functions. We denote by [/i, f2, ■ ■ ■ , fq] the linear subspace generated by the 
indicated vectors. We use the superscript star symbol (■)* to indicate hermi- 
tian conjugate and, when the bar is inconvenient, the complex conjugate of 
numbers. 

Theorem 1 Let H and K he complex finite- dimensional hilbert spaces and 
W : L{H) —>■ L{K) a linear map. Suppose W maps a positive rank-one 
operators to either a positive rank-one operators or zero, then its action on 
positive rank-one operators is either of the form Wp = GpG* where G . H ^ 
K is linear or antilinear, or Wp = Tr{Dp){k, ■)k for some positive operator 
D e L{H) and keK. 

We shall call the three kinds of transforms as being of the linear, antilinear 
and degenerate type. 



This theorem is a relative of Wigner's theorem [^] which we use in part 
of the proof. 

Proof: If fci, . . . , A;„ is an orthonormal basis for K, then one can write the 
action of W as 

WA = J2Tr{AL,,){ki,-)kj 

for some Lij G L{H). 

The action of W on rank-one positive operators can be considered as a 
map Sh Sk- Let T : H K he a. map, that represents this action. One 
easily shows that replacing L^j with ^{Lij -\- L*-) one has the same action on 
Sh and so we can assume that L*- = Lji, in particular the diagonal elements 
La are self adjoint. We have \{ki,Tf)\'^ = {f,Liif), so each La is a positive 
operator and if we let L = J^Lu, we have ||T/|p = {f,Lf). By the spectral 
theorem, there is a positive invertible operator M such that MLM is an 
orthogonal projector. By working with TM instead of T, one can assume 
that L is a projector and we do so. 
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Now if dim H — there is nothing to investigate. If dim H — 1, generated 
by the unit vector e, then the L.^ are plain numbers. By performing a 
unitary change of basis in K, the matrix (Lij) undergoes a unitary similarity 
transformation and since it is self-adjoint, can be brought into diagonal form. 
Since its rank must be at most one, it is either zero and there is nothing more 
to consider, or it has a diagonal form with Lu = 1 and the rest of the elements 
zero. In this last case we have Wp = CpC* where C : H ^ K is the linear 
map defined by Ce = ki and the theorem is true in this case. 

Thus we next investigate the case that dimH = 2. If dimK < 2 we can 
extend X by a direct summand to achieve dimension 2, and extend T by zero 
into this summand. The case of dim K>2we leave for later. For dim K — 2 
we choose orthonormal bases in H and K and identify both spaces with C^. 

One now has: 

s_( aI + (3-a AI + B-a\ 
[^ij) - [A*I + B*-a ^1 + 6 -a ) 

where a are the usual Pauli spin matrices. Now Lu and L22 sum to a 
projector. If this is / then L22 = I — Lu, if of rank one, then, being positive, 
both Lu and L22 must be proportional to it, and if zero, both must be zero. 
In any case there is a basis of H in which both are diagonal and so we can 
assume that both (3 and S have only the third component non zero which we 
shall at an appropriate point denote simply by j3 and 6. Let -u be a vector in 
of norm one, then u*au is a unit vector in and any such vector can be 
so constructed. Denoting such a vector by h one has that {Tu, ■)Tu, in the 
chosen basis, is given by the following matrix: 

a + j3jh A + B-h\ 
A* + B*-n -f + 5-n ) 

This matrix must have rank at most one for any choice of n and so it's 
determinant must vanish. This gives 

{a + /3 • n)(7 + 6-h) = {A + B- + B* -n) 

for all n. In deriving conclusions from this expression one must be careful to 
symmetrize the coefficients of the quadratic terms in n and to subtract from 
them the trace contribution since one has n • n — 1. This now gives us the 
following relations: 

aj+^P-S = lAf + ^B^'-B, 
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a5 + -f/3 = A*B + AB\ 
l3,6,+6i(3j-p-66,, = B*B^+B,B]-'^B''-B6ij. 

If now L is zero then both Lu and L22 are zero and we deduce from the first 
equation that Lij = and there is nothing more to prove. We now use the 
fact that Pi = P6i3 and Si = SSi^ The above equations now give rise to: 



1^. 

«7 + ^PO 


= \A\^ + -B*-B, 
1 1 3 






(1) 


a6 + 7/3 


= 2Re{A*B3), 






(2) 


Re {A*Bi) 


= 0, 




{^ + 3), 


(3) 


2/5(5 


= 2|53p - - 






(4) 


Re{BtBs) 


= 0, 




(z^3) 


(5) 


-(35 


= 2|5ip - l^al^ - 










= 2|S2p-|5i|2- 


\Bs\', 




(6) 


Re {BIB2) 


= 0. 






(7) 



From one deduces that = \B'^. Now one still has the freedom to 
make a unitary change of basis in if by a diagonal matrix with unimodular 
entries. The effect of this is to perform a real rotation of the B^) vector. 
In this way one can choose the imaginary part of Bi to be rotated to zero. 
Thus we assume Bi = B a. real number, and by (|^) one has B2 = ±iB. 
There are now two cases: 

Case I: B ^0 

From (^) we have A = and from B^ = 0. Equations (^) and (^) are 
now equivalent and give P6 = —B"^ and this in equation (|l]) gives 07 = i?^. 
Using these in equation we deduce that = /?^ and 7^ = 5^. Now since 
the La are positive operators we have that a > and 7 > 0. The projector 
L has the matrix: 

fa + -f + (3 + 6 \ 

V a + 7-/5-5y' 

Since B neither a nor 7 is zero. Using now the fact that a and 7 are 
positive, that /3 and 6 have opposite signs, and that = /5^ and 7^ = 5^ we 
deduce that neither diagonal entry in the matrix can be zero. Thus L must be 
the identity. This means that (3 = —6 and so all four numbers must have the 
same modulus. Exchanging the two basis elements in K exchanges the role of 
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Lii and L22, thus without loss of generahty we can put a = /? = 7 = — 5 = |. 
Changing the sign of one of the basis elements of H results in changing the 
sign of the [Bi, B2) vector so we can choose B to be positive, in which case 
it must be \ by previous equations. Thus this case reduces to the following 
canonical forms: 

1 / 1 + 0-3 (Ti ± i(T2 ■ 

2 V o"! =F 1(^2 I - (y-i 
Case II: B = 

A unitary change of basis in K causes the representation for (Ljj) to 
change by subjecting the two matrices 

a A\ f [3 B, 
A* 7/ \B^^ 6 

to the same unitary similarity transformation. The two matrices are her- 
mitian, and so there is no loss in generality in assuming that the second is 
diagonal and thus B3 = 0. The two now equivalent equations and 
give = I-B3P = 0, and then from we get 07 = There are now two 
subcases: 

Subcase Ila: L is rank one. 

Without loss of generality assume that the second entry of L is zero, then 
we have a + 7 = /5 + 5 and since the first entry must be one, one has that 
both sums must be |. Since (36 = one of the two must be zero and the 
other i. Without loss of generality we set 6 = 0. Equation (Q) now gives 
7 = which finally implies that A = and we have the canonical form: 

l/J + CTg 

2 I 
Subcase lib: L = I 

We now must have (3 + 6 = which with (36 = means that both are 
zero. Thus the second one of the displayed pair of matrices above is zero and 
we are free to diagonalize the first one. In this case, from 07 = one sees 
that either a or 7 must be zero and the other 1. Without loss of generality, 
assume a = 1, and we have the final canonical form: 

/ 


Let us now examine the cases. One of the forms of Case I corresponds to 
Wp = I pi and the other to W p = JpJ where J is the antilinear involution 
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on given by .J{ui,U2) = (■^1,^2). The form of Case Ila corresponds to 
Wp = PpP where P is the projector on the first component. The last form 
cannot be expressed via a hnear or antihnear transformation and corresponds 
to Wp = Tr{p){ki, ■)ki. We see here the three types mentioned at the 
beginning: the linear, antihnear and degenerate. 

If we take into account the changes of basis that were made, and the fact 
that L was changed into a projector by considering TM instead of T we see 
that the general (non-canonical) form for the degenerate type (still with both 
dimensions two) is Tf = 9{f){f,Df)^/'^k for some positive definite operator 
D and some k e K. 

Consider now the general case of arbitrary dimension greater or equal to 
two for each hilbert space. Choose a two-dimensional orthogonal projector 
Q in K and a two-dimensional orthogonal projector P in H. One has that 
QTP is one of the three types. These local representations will now be joined 
into a global one. 

Firstly we shall keep P fixed, and by joining the representations of the 
QTP, find a representation for TP. For simplicity's sake we shall generally 
omit writing P in the next two paragraphs. 

Consider the degenerate type. Suppose that Kq C K is a. subspace with 
the property that if Qq is the orthogonal projector on Kq, then QqT f = 
9{f){f, DfY^'^g for some g E Kq and some positive definite operator D. 
If Kq ^ K, choose a vector h orthogonal to Kq. Let Q be the orthogo- 
nal projection onto the subspace [g,h] and Qg the projector onto [g]. Sup- 
pose that QT is either of the linear or antihnear type. Then one has from 
QgQoTf = QgQTf that 9{f){f,Dfy/^ = e\f)(l){f) for some linear or anti- 
linear form /. This is obviously impossible due to the positive definiteness of 
D. Thus the type must be degenerate and we have QTf = 9'{f){f, D' fY^'^k 
for some vector k and some positive definite operator D' . By the same token 
as before we must now have e{f){f , D ff'^g = e'{f){f, D' fy/^Qgk. This 
means that g = aQgk for some non-zero number a and 6'{f){f,D'fy/'^ — 
a9{f){f,Dfy/'^ Let Ki be the span of Kq and h, and Qi the orthogo- 
nal projector onto it. From Qi = Qo + Q ~ Qg, we get that QiTf — 
0if)if,Dfy/'g + e'if)if,D'fy/\I-Qg)k = eU)U\DfYl^ak which is pre- 
cisely the form that was assumed for Kq. By induction therefore, if any of 
the two-dimensional forms QT is degenerate then Tf — 9{f){f, Dfy/'^k, for 
some vector k & K and some positive definite operator D. 

Suppose now that all the QT types are linear or antihnear. If the di- 
mension of the linear span of the range of T in is two or less then we 
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are through since for some two-dimensional projector Q we would have 
TP = QTP. Suppose therefore that there are at least three linearly in- 
dependent vectors in the range and so choose three such ^1,^2, ^3 that are 
pairwise orthogonal. Let Qij be the orthogonal projection on [ki, kj]. One has 
QijTf — dij{f){4>ij{f)ki + '4^ij{f)kj) for hnear or antilinear functionals, as the 
case may be, (f)ij and -0^ ■ In particular one has that 0i2 and ■0i2 are linearly in- 
dependent and so ipi^ is a linear combination of the two or their complex con- 
jugates, say ^/'i3(/) = a</)i2(/) + &^i2(/) (the case ip^if) = a0i2(/) + &V^i2(/) 
is analogous). Now when (puif) = 0, Tf is proportional to k2 and so 
{k^,Tf) — which implies that i/Jisif) = 0' hence 6 = 0. One must 
thus have ■0i3(/) be proportional to (puif) or its complex conjugate. A sim- 
ilar reasoning leads to the conclusion that ip2?,{f) is proportional to i/^uif) 
or its complex conjugate. But {k^,T f) = 6'i3(/)V'i3(/) = 02z{f)i^2i,{f)- This 
proportionality contradicts, by the linear independence of 4>i2 and V'12, the 
previously proved proportionalities unless both '0i3,i ==1,2 are zero. This 
now contradicts the existence of the ki. Thus the dimension of the span 
cannot be greater than two and this case is done. 

We now pass to joining the local representations of TP to a global one of 

T. 

Suppose that for a two-dimensional projector P one has that TPf — 
9{f){f, DfY^'^g for some positive definite operator D and some vector g. 
Suppose that the range of T is not wholly contained in a one-dimensional 
subspace. Then there is a vector h with Th and g linearly independent. 
Consider the two dimensional subspaces [/, h] where / G PH. Since on these 
subspaces the image of T contains at least two linearly independent vectors, 
the type cannot be degenerate and we must have by either linearity or antilin- 
earitythatT(/+a/i) = e,{f,a)Tf+92{f,a)aTh = 9,{f,a)9{f){f,Dfy/^g+ 
92{f,a)aTh. (Note that a complex number and its complex conjugate differ 
by a unimodular factor, so both the linear and antilinear cases are subsumed 
in the last expression.) This now defines the action of T on the three dimen- 
sional subspace PH® [h]. The coefficient of {g, ■)Th in {T{f+ah), ■)T{f+ah) 
must, by the general expression for W, be of the form (/ -|- ah, N{f + ah)) 
for some linear operator N, and so we must have 

9,{f,a)9{f)92{f, a){f, Dff'^a = 

(/, Nf) + «(/, Nh) + a{K Nf) + la^h, Nh) 

Now the first and fourth term on the right-hand side must vanish since the 
left-hand side vanishes when either f or a vanishes. The modulus of the 
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left-hand side does not change if we multiply a by a unimodular number, 
and for this to be true of the right-hand side then, for a given /, one of the 
two middle terms must vanish. Thus as a functional of /, if one of {f,Nh), 
{h, Nf) is not the zero functional, then the other must vanish whenever the 
first one doesn't, which is only possible for the zero functional. Thus only one 
of the middle terms is not zero, which contradicts the positive definiteness 
of D since a linear or anti-linear functional always has a non-trivial kernel. 
Thus in our case the image of T is contained wholly in a one-dimensional 
subspace [k] and one must have, by the general expression for W, taking 
ki = k, that Tf = 9{f){f, Luff/'^k on all of H. 

We are now left in the last case of all the types of TP being linear 
or antilinear. Remember that we can assumed that L is a projector. For 
f E H, one has / = Lf + {I — L)f. Assume both terms are not zero. 
On the two-dimensional subspace spanned by the two terms, T is either 
of the linear or antilinear type, thus Tf = ei{f)TLf + ^2(/)T(/ - L)/, 
but ||r(7 - L)f\\ = ||L(7 - L)f\\ = Thus \Tf \ = \TLf\ and so TL rep- 
resents the same transformation Sh Sk that T does. One thus need 
only consider T on LH and so assume that L is the identity (of dimen- 
sion at least two, as the one- dimensional case was already settled). With 
this assumption, one has that on any two-dimensional subspace, T is repre- 
sentable by either a linear or antilinear isometry, and hence, by the polar- 
ization identity, preserves the moduli of inner products. Let ei, . . . ,e„ be 
an orthonormal basis for i7, then the Tei, . . . , Te„ are orthonormal. By lin- 
earity or antilinearity on two-dimensional subspaces we have T(X)" ttiCj) = 
T{aiei + Yyi (^i^i) = 0{a)aiT{ei) + 9'{a)T{J22 ca^i)- Continuing in this vein 
one has T{J2'^ aiCi) — 9i{a)aiT(ei). Thus the range of T is totally con- 
tained in the subspace generated by the Tei, . . . , Te„. By working only with 
the range, we can thus assume K and H are of the same dimension and so 
we identify them. By Wigner's theorem, any mapping of a hilbert space into 
itself which preserves the moduli of inner products is representable by either 
a unitary or an antiunitary operator. This completes the last case and the 
proof of the theorem. Q.E.D 
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